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We develop a theory for the market impact of large trading orders, which we call 
metaorders because they are typically split into small pieces and executed incremen- 
tally. Market impact is empirically observed to be a concave function of metaorder 
size, i.e. the impact per share of large metaorders is smaller than that of small 
metaorders. We formulate a stylized model of an algorithmic execution service and 
derive a fair pricing condition, which says that the average transaction price of the 
metaorder is equal to the price after trading is completed. We show that at equilib- 
rium the distribution of trading volume adjusts to reflect information, and dictates 
the shape of the impact function. The resulting theory makes empirically testable 
predictions for the functional form of both the temporary and permanent compo- 
nents of market impact. Based on the commonly observed asymptotic distribution 
for the volume of large trades, it says that market impact should increase asymptot- 
ically roughly as the square root of size, with average permanent impact relaxing to 
about two thirds of peak impact. 
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I. INTRODUCTION 



Market impact is the expected price change conditioned on initiating a trade of a given 
size and a given sign. Understanding market impact is important for several reasons. One 
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motivation is practical: To know whether a trade will be profitable it is essential to be 
able to estimate transaction costs, and in order to optimize a trading strategy to minimize 
such costs, it is necessary to understand the functional form of market impact^. Another 
motivation is ecological: Impact exerts selection pressure against a fund becoming too large, 
and therefore is potentially important in determining the size distribution of funds^. Finally, 
an important motivation is theoretical: Market impact reflects the shape of excess demand, 
the understanding of which has been a central problem in economics since the time of Alfred 
Marshall. 

In this paper we present a theory for the market impact of large trading orders that are 
split into pieces and executed incrementally. We call these metaorders^ . The true size of 
metaorders is typically not public information, a fact that plays a central role in our theory. 
The strategic reasons for incremental execution of metaorders were originally analyzed by 



Kyle (1985), who developed a model for an inside trader with monopolistic information 



about future prices. Kyle showed that the optimal strategy for such a trader is to break her 
metaorder into pieces and execute it incrementally at a uniform rate, gradually incorporating 
her information into the price. In Kyle's theory the price increases linearly with time as the 
trading takes place, and all else being equal, the total impact is a linear function of size. 



The prediction of linearity is reinforced by Huberman and Stanzl (2004) who show that. 



providing liquidity is constant, to prevent arbitrage permanent impact must be linear. 

Real data contradict these predictions: Metaorders do not show linear impact. Empirical 
studies consistently find concave impact, i.e. impact per share decreases with size^. It is in 
principle possible to reconcile the Kyle model with concave dependence on size by making the 
additional hypothesis that larger metaorders contain less information per share than smaller 
ones, for example because more informed traders issue smaller metaorders^. A drawback of 
this hypothesis is that it is neither parsimonious nor easily testable, and as we will argue 
here, under the assumptions of our model it violates market efficiency. 



^ See Bertismas and Lo ( |1998[ ), Almgren and Chriss ( |1999|2000| , Almgren ( |2003| , Almgren, Thum, and 



Hauptmann (2005), and Obizhaeva and Wang (2005). 
^ See for example Berk and Green (2004). Schwartzkopf and Farmer (2010) have shown that managers of 



large mutual funds offset increases in market impact by lowering fees, slowing down trading and diversi- 
fying assets. 

Other names used in the literature are large trades, packages, or hidden orders. 
^ An early study of metaorders, which links together the individual trades coming from a given client, is 



Chan and Lakonishok (1993 1995). Later studies by Torre (1997), Almgren et al. 2005 Engle et al. 



(2008), and Moro et al. (2009) find concave temporary impacts roughly consistent with a square root 



functional form. The functional form of permanent impact is harder to measure and more controversial. 
These studies should be distinguished from the large number of studies of the market impact of individual 
trades or the sum of trades in a given period of time, that do not attempt to link together the individual 



trades coming from a given client. See Hasbrouck ( 1991 ), Hausman, Lo and MacKinlay ( 1992 ), Keim and 



Madhavan (1996), Torre (1997), Kempf and Korn (1999), Plerou et al. (2002), Evans and Lyons (2002) 



Lillo, Farmer, and Mantegna (2003), Potters and Bouchaud (2003), Gabaix et al. (2003 2006), Chordia 



and Subrahmanyam 2004 Farmer, Patelli and Zovko (2005), Weber and Rosenow (2006), and Hopman 



(2006). 

^ Tins results in concave dependence on size but preserves linear dependence as a function of time. Our 
model in contrast predicts that both size and time follow the same concave functional form. The empirical 
results strongly support concave dependence on size, whereas the dependence on time is an open question. 



4 



Huberman and Stanzl are careful to specify that linearity only applies when liquidity 
is constant. In fact, liquidity fluctuates by orders of magnitude and has a large effect on 
price fluctuations^. Empirical studies find that order flow is extremely persistent, in the 
sense that the autocorrelation of order signs is positive and decays very slowly. No arbitrage 
arguments imply either fluctuating asymmetric liquidity as postulated by Lillo and Farmer 
(2004), or no permanent impact, as discussed by Bouchaud et al. (2004)^. 



The central goal of our model is to understand how order splitting affects market impact*. 
Whereas Kyle assumed a single, monopolistic informed trader, our informed traders are 
competitive. They submit their orders to an algorithmic execution service that bundles 
them together as one large metaorder and executes them incrementally. We show that this 
leads to a symmetric Nash equilibrium satisfying the condition that the final price after a 
metaorder is executed equals its average transaction price. We call this condition fair pricing, 
to emphasize the fact that under this assumption trading a metaorder is a breakeven deal 
- neither party makes a profit as a result of trading^. Our equilibrium is less general than 
Kyle's in that it assumes uniform execution, but it is more general in that it allows an 
arbitrary information distribution. This is key because, as we show, there is an equilibrium 
between information and metaorder size, making it possible to match the metaorder size 
distribution to empirical data. 

Combining the fair pricing condition with a martingale condition makes it possible to 
derive the price impact of metaorders as a function of the metaorder size distribution. This 
allows us to make several strong predictions based on a simple set of hypotheses. For a 
given metaorder size distribution it predicts the average impact as a function of time both 
during and after execution. We thus predict the relationship between the functional form 
of two observable quantities with no a priori relationship, making our theory falsifiable in 
a strong sense. This is in contrast to theories that make assumptions about the functional 
form of utility and/or behavioral or institutional assumptions about the informativeness of 
trades, which typically leave room for interpretation and require auxiliary assumptions to 
make empirical tests. 



Farmer et al. (2004) show that fluctuations in instantaneous liquidity can span as much as three orders 



of magnitude for the same equity in the course of a year; Gillemot et al. ( 2005 1 show that liquidity fluctu- 



ations dominate volume fluctuations in driving clustered volatility. Some of the theoretical consequences 



of time varying liquidity have been investigated by Challet (20071 and Gatheral (2010). 
^ Asymmetric liquidity means that the price response to a buy order differs from the price response to a 
sell order of the same size. The persistence of order flow implies predictability of order signs. If impact 
has a non-zero permanent component, then if the next order is likely to be a buy, no-arbitrage implies 
that the price response to a buy order must be smaller than the price response to a sell order. See also 



Bouchaud et al. (2006), Farmer et al. (2006), Wyart et al. (2006), and Bouchaud, Farmer and Lillo 



(2009). For a precursor of the theory developed here see the PhD thesis of Gerig (20071. For an early 



attempt to derive a theory yielding a square root market impact see Zhang (1999). These theories are 



for the impact of individual transactions, and the answers they give for the time and size dependence of 
impact are quite different than those we derive here for metaorders that are typically composed of many 
individual transactions. 

The persistence of order flow in the London Stock Exchange has been shown to be overwhelmingly due 



to order splitting rather than herding by Toth et al. (2011 1. 
^ Neither party makes a profit in a one way trade based on current prices, but for a round trip the aggressive 
party nonetheless incurs losses equal to the one way permanent impact. 
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For example, Gabaix et al. (2003 2006) have also argued that the distribution of trading 
volume plays a central role in determining impact, and have derived a formula for impact that 
is concave under some circumstances. However, in contrast to our model, their prediction 
for market impact depends sensitively on the functional form for risk aversion^°. Our theory, 
in contrast, is based entirely on market efficiency and does not depend on the functional 
form of utility. 

Our work here is related to several papers that study market design. Viswanathan and 
Wang (2002), Glosten (2003), and Back and Baruch (2007) derive and compare the equi- 
librium transaction prices of orders submitted to markets with uniform vs. discriminatory 
pricing. Depending on the setup of the model, these prices can be different so that investors 
will prefer one pricing structure to the other and can potentially be "cream-skimmed" by 
a competing exchange^^. The fair pricing condition we introduce here forces the average 
transaction price of a metaorder (which transacts at discriminatory prices) to be equal to 
the price that would be set under uniform pricing. Fair pricing, therefore, means investors 
have no preference between the two pricing structures, and they have no incentive to search 
out arrangements for better execution. On the surface, this result is similar to the equiva- 
lence of uniform and discriminatory pricing in Back and Baruch (2007). However, in their 
paper, this equivalence results because orders are always allowed to be split, whereas ours 
is a true equivalence between the pricing of a split vs. unsplit order. 

In Section II we give a description of the model and discuss its interpretation. In Section 
III we develop the consequences of the martingale condition and show how this leads to zero 
overall profits and asymmetric price responses when order flow is persistent. In Section IV 
we show that any Nash equilibrium must satisfy the fair pricing condition. In Section V we 
derive in general terms what this implies about market impact. In Section VI we introduce 
speciflc functional forms for the metaorder size distribution and explicitly compute the 
impact for these cases. Finally, in Section VII we discuss the empirical implications of the 
model and make some concluding remarks. 



II. MODEL DESCRIPTION 



We study a stylized model of an algorithmic trading service combining and executing 
orders of long-term traders. This can be thought of as a broker-dealer receiving multiple 
orders on the same security and executing them algorithmically at the same time, or as 
an institutional trading desk of a large asset manager combining the orders from multiple 
portfolio managers into one large metaorder. Our goal is to model the price impact of a 
metaorder during a trading period in which it may or may not be present. We set up the 
model in a stylized manner as a game in which trading takes place across multiple periods, 
after which flnal prices are revealed. While this is somewhat artiflcial in comparison to a real 
market (which has no such thing as a "flnal" price), the framework is simple enough to allow 
us to flnd a solution, and the basic conclusions should apply more broadly. The structure 



Gabaix et al. argue that if risk aversion is proportional to cr^ , where a is the standard deviation of profits, 
the impact will increase with the size N of the metaorder as N^^^. Thus the impact is concave if (5 < 2, 
linear if 5 = 2 (i.e. if risk is proportional to variance), and convex otherwise. For another theory that also 



predicts concave impact see Toth et al. (2011) 
For example, see Bernhardt, Hughson, and 



aganathan ( 2002 1 
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of the model is in many respects similar to the classic framework of Kyle (1985), but with 



several important differences. (A point-by-point comparison is made in the conclusions). 



A. Framework 

We begin with an overview. There is a single asset which is traded in each of periods, 
which can be regarded as a game. There are three kinds of agents, long-term traders, 
market makers and day traders. The long-term traders have a common information signal 
that is received before the game starts; based on this information they formulate their 
orders and submit them to an algorithmic trading service. The algorithmic service operates 
mechanically, dividing the bundle of orders (which we call a metaorder) into equal sized 
lots which are submitted as anonymous market orders in each successive period until the 
metaorder is fully executed. The day traders, in contrast, receive a new information signal 
and submit orders based on this signal in every period of the game. In every period each 
market maker observes the netted order of the long-term and day traders and submits a quote 
accordingly; all orders are filled at the best price. The game ends with a final liquidation 
period at time step A + 1 in which prices are set exogenously to reflect the accumulated 
information. The length A of the game varies based on the amount of information received, 
and is unknown to the market makers. 

We now discuss the setup of the game in more detail, beginning with a description of 
each of the agents. The long-term traders receive a common information signal a before the 
game begins, which only they observe. In order to model situations in which there may or 
may not be a metaorder present, we allow a nonzero probability that a = 0. Thus with 
probability /i the signal a is drawn from a distribution p{a), which has nonzero support 
over a continuous interval —amax < « < <ymax, where < amax < oo, and with probability 
(1 — /i) there is no information, i.e. a = 0. For simplicity we discuss the case where the 
draw of a happens to be positive, i.e. it causes the final price to increase, but the results 
apply equally well if a is negative. 

After a is revealed the orders of the long-term traders are aggregated together into a 
metaorder and executed in a package. The bundling and execution process can be thought 
of as representing an algorithmic trading firm, broker, or institutional trading desk. There 
are /C long-term traders, labeled by an index k = 1,2, . . . ,/C, where /C is a large number. 
After the common information signal is received each long-term trader submits an order of 
size < nk{a) < M, where M{amax) is a large positive integer. Each long-term trader 
decides nfc(a) independently. The individual orders are bundled together into a metaorder 
of size Ylf^i Uk shares. 

The algorithmic trading firm operates purely mechanically, chopping the metaorder into 
equal pieces and submitting market orders at successive times t = 1,...,A. These are 
executed at transaction prices S'^, where 1 < A < M, as illustrated in Figure [Tj We assume 
the metaorder is executed in lots, which for mathematical convenience are chosen to be of /C 
shares. The imposition of a maximum trade size M is a technical detail^^; we assume that 



We have imposed a bound on a, which induces a bound on A/, to avoid mathematical problems that 
occur in the limit M — > oo. This is explained in the Appendix. For the typical situations we have in 
mind M, K,, and {M — 1)//C are all large numbers. The choice of lots of size K, shares is purely a matter 
of convenience; the assumption that the lots have a constant size is something we hope can be relaxed in 
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t=0 t=l t=2 t=3 t=M-l t=M t=M+l 

FIG. 1: The tree of possible price paths for a buy metaorder for different sizes N. For this figure 
we assume the metaorder is present and show only expected price paths, averaged over the day 
trader's information. The price is initially 5*0; after the first lot is executed it is Si = Sq + Rq. If 
= 1 it is finished and the price reverts to S2 = Si — Ri, but if > 1 another lot is executed and 
it rises to S2 = Si + Ri. This proceeds similarly until the execution of the metaorder is completed. 
At any given point the probability that the metaorder has size N > t, i.e. that the order continues, 
is Vt- If had we followed a typical price path under circumstances when the day trader's noisy 
information signal is large, rather than the expected price paths shown here, the sequence of prices 
would be a random walk with a time- varying drift caused by the metaorder 's impact. 

M is large. If the metaorder is present (i.e. if a 7^ 0), trading ends when the metaorder 
is fully executed, i.e. when = (^f^ifiki/K,- The equilibrium distribution of metaorder 
lengths is ptv. If a = we randomly choose a value of N from an arbitrary distribution 
(which in general differs from Pn)- 

The day traders can be treated as a single representative agent who receives a private 
information signal rjt at the beginning of each period t of the game, and submits a market 
order (either to buy or sell) of size F{T]t), where rjt is a zero mean IID noise process with 
an otherwise arbitrary distribution P{rit), and F is an increasing function^^. There is no 
restriction on the size of 77, and in particular we allow the possibility that rjt is large and 
negative while a is positive, so that during the course of execution of the metaorder the 
combined order flow may change sign, and that at different times the market makers may 
buy or sell. 

At each time step the market orders of the long-term traders and the day trader are netted 



the future. 

The precise functional form of F is not important here. We assume the day traders do not engage in 
order splitting, i.e. they trade on the information they receive in given time step only in that time step. 
This guarantees that in the absence of a metaorder the day trader's order flow provides a sufficient signal 
for a market maker to infer the new information. 
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and the single combined order is submitted to the market makers, who are competitive and 
profit maximizing. We are not assuming any special institutional privileges, such as those of 
the specialists in the NYSE; our market makers are simply competitive liquidity providers. 
At each time step each market maker observes the combined order and submits a quote. The 
combined order is fully executed by the market maker(s) offering the best price. The market 
makers are able to take past order flow and prices into account in setting their quotes. Given 
that Tjt is IID, the typical transaction price sequence {S'^} will look like a random walk, as 
the market maker responds to the day trader's order flow, with a possible superimposed 
drift if a metaorder is present. 

In the final period t = N + 1 the combination of the long-term information a and the 
accumulated short term information are revealed. Since the long-term traders' information 
signals are independent of those of the day trader, information is additive and the final price 
is 

N 

S'^^^ = So + a + ^rit, (1) 
t=i 

where 5*0 is the initial price. The average final price is SN^i{a) = E[S'j^_^_^\a] = So + a, 
where E implies an average over the noise r)t, 

E[x] = J xP{ri)dr]. (2) 

Similarly the average transaction price at time t is St{a) = E[S^\a\. The goal of the paper 
is to compute the average immediate impact Xt = Sf — Sq and the average permanent impact 
/jv = S^+i — Sq of the metaorder. The corresponding incremental average impacts are 
Rt = St — St+i and Rf — St+i — St- As we will show, at the equilibrium the average impacts 
do not depend on /x, p or P. 

The number of long-term traders /C is fixed and is common knowledge. The market 
makers know the initial price 5*0, the information distributions p{a) and P{r]t), they can 
deduce the function F relating the day trader's information to their order size, and they 
observe the net combined order in each period and remember previous order flow. However 
they do not know the information signals a or r]t, and thus they do not know how much 
of the order fiow to ascribe to the long-term trader vs. the day trader. They do not know 
whether a metaorder is present, and if it is, they do not know its size. 

Perhaps the strongest assumption we have made concerns knowledge of the timing of 
metaorders. The market makers know the period t, and as a result if a metaorder is present, 
they know when it started. In typical market settings order fiow is anonymous and the 
starting time of a metaorder is uncertain. Nonetheless, for long meta-orders and sufficiently 
high participation rates the starting time can be inferred from the imbalance in order flow^^. 



The starting time can be inferred by treating order signs as a binomial random process. The imbalance 
required to reject the null hypothesis of unbiased order flow is qVi, where q is the desired number of 

standard deviations of statistical significance. The accumulated imbalance after t steps is zt, where z is 
the participation rate. Equating these gives t — (q/z)^. Hiding an order of size N requires z < q/y/N. 
Thus larger metaorders need to be executed more slowly to avoid detection. Since the time needed to 
complete execution is inversely proportional z, for a large metaorder this can become prohibitive - it is 
impossible to escape detection. 
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Assuming a typical participation rate of 20% and two standard deviations to reject the null 
hypothesis of balanced order flow means that a metaorder can be detected after about 100 
timesteps. Large metaorders are frequently executed in 1000 or even 10, 000 lots^^, implying 
an error in inferring the starting time of 1% to 10%. Thus, it is not unrealistic to imagine 
that market makers can infer the presence of large metaorders and estimate their starting 
times. 

We are not concerned here with the question of whether or not the algorithmic execution 
service's strategy of splitting the order into equal pieces is an optimal strategy. Our goal 
is instead to assume that this is what they do, and to derive the implications for price 
impact. Our main result is to derive the equilibrium relationship between the metaorder 
size distribution and the average immediate and permanent impacts It and 1^. 



B. Uncertainty of metaorder size and persistence of order flow 

In addition to taking expectations over the day trader's noise 77, which we denote by E, we 
must compute expectations about the length of the game. The crux of our argument hinges 

around the market makers' ignorance of a; when a ^ this translates into uncertainty about 
metaorder size. We will use the notation Ef to represent an average over all metaorders of 
size N > t, and as described above, E for averages over rjt. For a generic function the 
average over metaorder sizes is 

EM f ^M-t r 
_ N=t PnJN _ Z^i=o Pt+iJt+i /o\ 
^tUN\ - - „M-t ' i-^i 

where in the last term we made the substitution N = t + i. 

Assuming that a metaorder is present, the likelihood that it will persist depends on the 
distribution and the number of executions t that it has already experienced. Let Vt be 
the probability that the metaorder will continue given that it is still active at timestep t. 
This is equivalent to the probabihty that it is at least of size t, i.e. 

rt - ^S^. (4) 

This makes precise how order splitting can make order flow positively auto correlated. In 
particular, if pn has heavy tails than an exponential Vt will increase with time and induce 
persistence in order flow. 

In the case where no metaorder is present one can similarly define the probability that 
the game continues as 

l^i=t+lPi 



EM 
i=tPi 



Metaorders can be extremely large. For example, the New York Times recently reported that Warren 
Buffet buffet took 8 months to buy a 5.5% share of IBM. 
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III. MARTINGALE CONDITION 

In this section we introduce a martingale condition and discuss its implications for liq- 
uidity and overall profitability. 

A. Derivation of martingale condition 

Market makers must set prices given only past and present order flow information, without 
knowing whether the order flow originated from a metaorder or from day traders. Their 
decision function is of the form 

= /(s/C + F{rit), sK + F(77t_i), . . . , s/C + F{r^,)), 

where s = sign [a] . As we show below, we are able to finesse this difficult problem by imposing 
a martingale condition and averaging over the day traders' noise, which is sufficient for the 
main goal of this paper of deriving the equilibrium between the impacts and pjy- 

During the game the market makers may also use information from order flow and prices 
to update their prior probability /x for the presence of a metaorder to a more accurate value 
We assume that transaction prices are a martingale, so that the current transaction 
price S[ is equal to the expected future price. Define an indicator variable m with m = T 
is the metaorder is present and m = F if it is absent. For the price in the next period it is 
necessary to average over four possibilities: 

1. With probability n' the metaorder is present and with probability Vt trading continues. 
In this case ^[^^+1!^^, m = T] = S't + R't, where R'^ = E[{S't^^ - S't)\S't, m = T]. 

2. With probability //' the metaorder is present and with probability 1 — Vt time t is 
the last trading period. In this case E[S^_^i\S[,'m — T] — S'^ — R'f., where = 
E[{Si^,-S',)\Si,m = T]. 

3. With probability 1 — /x' the metaorder is not present and with probability Vt trading 

continues. Since the day trader's information r]t is zero mean and IID, the expected 
average transaction price on the next time step conditioned on the current transac- 
tion price must satisfy E[S[_^i\Sl,m = F] = S[, i.e. the average transaction price is 
unchanged. 

4. With probability 1 — fi' the metaorder is not present and with probability I — Vt time 
t is the last trading period. For similar reasons m = F] = S'^, i.e. the 
average final price conditioned on the current transaction price is equal to the current 
transaction price. 

Thus the martingale condition can be written 

S', = {Vtm+i\S't,m^ T] + {1- Vt)m+i\S't,m^ T]) 
+ (1 - (Vtm^,\Si, m = F] + (1 - Vt)m^,\Si, m = F]) 
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Substituting for the average next price conditioned on the current price for cases 1-4 gives: 

S[ = {1- i,[)~S[ + ii[ (VtCS't + R't) + (1 - Vt)iS[ - R[)) . (5) 

The current transaction price S'^ and fi' both cancel and this reduces to 

VtR[ - (1 - VM = 0. 

Since rjt is mean zero and IID, E[R'f\ = Rt = St+i — St and similarly for R[. Taking averages 
over rjt gives 

VtRt - (1 - Vt)Rt = 0. (6) 

Equation ^ holds for t = l,2,..,M — 1. If the metaorder has maximal length at the end of 
the Mth interval by definition Vm = 0, which implies that Sm = Sm+i- 

Let us pause for a moment to digest this result. We started with a martingale condition 
for realized prices, including fluctuations caused by day traders, and then reduced it to a 
martingale condition for the average impact due to the presence of a metaorder. The reduced 
martingale no longer depends on Ht, pj^, or the day trader's information. The fact that we 
assume a martingale for realized prices implies that arbitrage of the impact is impossible. 

The ability to average away the day traders is a consequence of our assumption that a 
and rjt are independent, which implies additivity of information. This separates the problem 
of the metaorder's impact from that of the day trader's impact - the metaorder's impact 
effectively rides on top of the day trader's impact. As we will see, the virtue of this approach 
is that it allows us to infer quite a lot without needing to solve for the market makers' optimal 
quote setting function /. 



B. Asymmetric price response 

Equation (|6| can be trivially rewritten in the form 

^ ^ l-Vt 
Rt Vt ' 



(7) 



where Rt = St — St+i and Rt = St+i — St- Thus the martingale condition fixes the ratio of 
the price responses Rt and but does not fix their scale. If Vt is large, corresponding to a 
metaorder that is likely to continue, then Rt/Rt small. This means that the price response 
if the order continues is much less it is than if it stops. 

To complete the calculation we need another condition to set the scale of the price re- 
sponses Rt and Rt, which may change as t varies^^. Such a condition is introduced in 



Section |IV[ Even without such a condition, one can already see intuitively that "all else 
equal", for a heavy-tailed metaorder distribution p^, the impact will be concave. (Recall 
that heavy tails in imply that Vt increases with t). 



Various authors have used alternative conditions. For example, Gerig (2007) used a symmetry condition, 
which can be derived from assumptions of linearity. The fair pricing condition that we derive here has 
the advantage that it can be justified based on equilibrium arguments. 
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C. Zero overall profits 

The martingale condition implies that the market makers break even overall, i.e. that 
their total profits summing over metaorders of all sizes is zero. This is stated more precisely 
in proposition one. 

Proposition 1. The market makers transact lots at average prices {St}, which later 
are all valued at a final price Sn+i- The martingale condition implies zero overall profits, 
i.e. 

M 

n = Ei[kn] = J2 ^^NPN = 0, (8) 

Ar=l 

where 

1 ^ 

t=i 

is the profit per lot transacted. The proof of proposition 1 is given in Appendix A. The phrase 
"overall profits" emphasizes that the martingale condition only implies zero profits when 

averaged over metaorders of all sizes. It allows for the possibility that the market makers 
may make profits on metaorders in a given size range, as long as they take corresponding 
losses in other size ranges. 

Surprisingly, Proposition 1 is not necessarily true when M is infinite. The basic problem 
is similar to the St. Petersburg paradox: As the metaorder size becomes infinite it is possible 
to have infinitely rare but infinitely large losses. The conditions under which this holds are 
more complicated, as discussed in Appendix A. 

IV. FAIR PRICING 

We now derive the fair pricing condition, which states that for any N 

1 ^ 

7r;v = ^ J]^i--5;v+i = 0. (10) 

Under fair pricing the average execution price is equal to the final price. We call this 
fair pricing for the obvious reason that both parties would naturally regard this as "fair". 
Fair pricing implies that the market makers break even on metaorders of any size, as op- 
posed to the martingale condition, which only implies they break even when averaging over 
metaorders of all sizes. 

A. Nash equilibrium 

Proposition 2. If the immediate impact Xt has a second derivative hounded below zero^'^ , 
in the limit where the number of informed traders fC oo, any Nash equilibrium must satisfy 



Note that this is sufficient for concavity. 
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the fair pricing condition ttjv = for 1 < N < M . On average market makers profit from 
orders of length one and take (equal and opposite) losses from orders of length M. 

This result is driven by competition between informed traders. All informed traders 
receive the same information signal a, and the strategy of informed trader k is the choice 
of the order size nk{a). The orders are then bundled together to determine the combined 
metaorder size N = X]a;=i ^'!:('^)- '^^^ decision of each informed trader is made without 
knowing the decisions of others. 

The derivation has two steps: first we examine the case vr^r ^ for 1 < < M, and 
show that if others hold their strategies constant, providing the impact is concave and /C is 
sufficiently large, traders can increase profits by changing strategy. Secondly we show that if 
ttat = there is no incentive to change strategy. Then we return to examine the cases = 1 
and A^ = M, which must be treated separately. The derivation is given in the Appendix. 

In contrast to the martingale condition, which only implies that immediate profits are 
zero when averaged over size, fair pricing means that they are identically zero for every 
size. It implies that no one pays any costs or makes any profits simply by trading in any 
particular size range. 

The Nash equilibrium is symmetric, i.e. all agents make the same decision. (This must 
be in any case since there is nothing to distinguish them). This means that there is a unique 
order size n{a) = nk{a) for any a, and the distribution of information p{a) implies the 
distribution of metaorder size pn- We can use pn as a proxy for p{a), which is the key fact 
allowing us to state our results in terms of the observable quantity pn rather than p{a), 
which is much more difficult to observe. 

Although this derivation is based on rationality, the fair pricing condition potentially 
stands on its own, even if other aspects of rationality and efficiency are violated. In modern 
markets portfolio managers routinely receive trade cost analysis reports that compare their 
execution prices relative to the close, which for a trade that takes place over one day is a good 
proxy for S^^i. Such reports are typically broken down into size bins, making persistent 
inconsistencies across sizes clear. Execution times for metaorders range from less than a 



day to several months (Vaglica et al., 2008), and are much shorter than typical holding 



times, which for mutual funds are on average a year and are often much more (Schwartzkopf 



and Farmer, 2010). Thus the statistical fiuctuations for assessing whether fair pricing holds 



are much smaller than those for assessing informational efficiency. Since ttat is fairly well 
determined, portfolio managers will exert pressure on their brokers to provide them with 
good execution. As a result we expect the fair pricing condition to be obeyed to a higher 
degree of accuracy than the informational efficiency condition. 



V. GENERAL EXPRESSIONS FOR IMPACT 

Although we have derived the Nash equilibrium only in the case where the immediate 
impact is concave, for the remainder of the paper we will simply assume that the martingale 
condition holds for all A^ and the fair pricing condition holds for 1 < A^ < M. This allows 
us to derive both the immediate impact In and the permanent impact In for any given 
metaorder size distribution pn- We later argue that for realistic situations the metaorder size 
distribution gives rise to a concave impact function, consistent with the Nash equilibrium. 
The martingale condition (Eq. |6]) and the fair pricing condition (Eq. 10) define a system 



of linear equations for St and St at each value of t, which we can alternatively express in 
terms of the price differences Rt = St+i — St and Rt = St — St+i, where t = 1, . . . , M. 
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The martingale condition holds for t = 1,...,M and the fair pricing condition holds for 
t = 2, ... ,M — 1. There are thus 2M — 2 homogeneous linear equations with 2M — 1 
unknowns^*^. Because the number of unknowns is one greater than the number of conditions 
there is necessarily an undetermined constant, which we choose to be Ri. 

Proposition 3. The system of martingale conditions (Eq. ^ and fair pricing conditions 



(Eq. 10) has solution 



Rt = -^ ^^^1 t = 2,3,...,M-l (11) 



^^ = T~Pt^' t = l,2,...,M-l (12) 

The proof is given in Appendix A. 

An important property of the solution is the equivalence of the impact Xt as a function 
of either time or size. This is in contrast to the prediction of an "extended" Kyle model 
under the assumption that traders of different sizes are differently informed, which yields 
linear impact as a function of time, but allows the slope to vary nonlinearly with size. 



A. General solution for immediate impact 



Summing Eq. 11 implies that for N > 2 the immediate impact is 



t-i 



It = St- So = Ro + Riil + J^l^^ ^a/^ I ' (13) 

V k=2'^l^i=k+lPil^i=kPiJ 

For t = 1 the immediate impact is Xi = Si — So = Rq = Rq and for t = 2 it is X2 = S2 — Sq = 
Rq + Ri. (The meaning of the undetermined constants Ri and i?o is discussed in a moment). 

B. General solution for permanent impact 

The permanent impact Sn+i — Si is easily obtained. Making some simple algebraic 
manipulations 

Sn+1 = Sn^i — Sn + Sn = Sn — Rn = Sn — :=r—RN- 

i — Hn 



By combining Eqs. (13) and Q we get 

In = Sn+1 - Sq = Rq + Ri ll + J.^^ ^a/^ ~ .r ^ai-n^ 1 • (1^) 

V k=2 ^ 22^=k+l Pi 22i=k P^ N 22^=0 PN+^ ) 



The price S'm+i does not exist, so Rm is not needed. This reduces the number of unknowns by one. 
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C. Setting the scale 

We have expressed both the permanent and immediate impact purely in terms of and 
the undetermined constants Rq and Ri. The undetermined constants can in principle be 
fixed based on the information at the equilibrium. At the equilibrium information signals 
in the range a G [0, ai] will be assigned to metaorders of size one, with an average size ai, 
signals in the range a G (ai, 02] will be assigned to metaorders of size two, with an average 
size 0.2-1 and so on. The scale of the impact is set by the relations 

Ro = h = "1 
Ro + Riz ^ = I2 = a2 

We have used the words "in principle" because, unlike metaorder sizes, information is not 
easily observed. 

Barring the ability to independently measure information, the constants Rq and i?i remain 
undetermined parameters. i?i > plays the important role of setting the scale of the impact. 
The constant Rq, in contrast, is unimportant - it is simply the impact of the first trade, 
before the metaorder has been detected. 



ap{a)da 
ap{a)da 



D. Relation between volume and information 

The power of the theory developed here is that the impact is predicted in terms of p^r, 
which is directly measurable (at least with the proper data). The calculation to set the scale 
shows how the continuous variable a maps onto the discrete variable N. a^v is a discrete 
function whose inverse can be written N{ai). If there is a continuum limit for large A^, the 
two distributions are related by conservation of probability as 

p{a)=pN^. (15) 

So for example, if the empirical metaorder size is p^ is asymptotically Pareto distributed, 
as argued in the next section, P/v ~ N~^^^^\ a = N^~^, and p{a) = Based 
on the empirically observed value (3 ^ 1.5, this gives p{a) = a~^, which means that the 
cumulative scales as P(a > x) ~ x~^. This is what is typically observed for price returns in 



American stock markets (Plerou et al., 1999). 



VI. DEPENDENCE ON THE METAORDER SIZE DISTRIBUTION 

We have so far left the metaorder size distribution p^ unspecified. In this section we 
compute the impact for two examples. The first of these is the Pareto distribution. 



1 



(16) 
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which we argue is well-supported by empirical data^^. The second is the stretched exponen- 
tial distribution, which is not supported by data, but provides a useful point of comparison. 



A. Empirical evidence supporting the Pareto distribution 



There is now considerable accumulated evidence that in the large size limit in most major 



equity markets the metaorder size V is distributed as P{V > v) 



with (3 ^ 1.5. 



Trade size. In many different equity markets for large trades the volume V has been 
observ ed by several groups to be distributed as a power law (Gopikrishnan et al. 
(2000); Gabaix et al. (2006)^°. This relationship becomes sharper if only block trades 



are considered (Lillo et al., 



2005) 



Long-memory in order flow. The signs of order flow in many equity markets are 
observed to have long-memory^^ This means that the transaction sign autocorrelation 
function C{t) decays in time as C(r) ~ r"'^, where < 7 < 1. Under a simple theory 
of order splitting the exponent /3 = 7 + 1, which is in good agreement with the data 
[(Lillo et al. 



2005); (Gerig, 2007); Bouchaud, Farmer, and Lillo (2009) 



Reconstruction of large metaorders from brokerage data. Vaglica et al. (2008) re- 



constructed metaorders Spanish stock exchange using data with brokerage codes and 
found that N is distributed as a power law^^ for large N with (3 ~ 1.7. 



There is thus good evidence that metaorders have a power law distribution, though more 
study is of course needed. 



B. Market impact for Pareto metaorder size 

In this section we derive the functional form of the impact for Pareto metaorder size. We 



do this by using Eq. [T3|in the limit as M — )■ cxd, and return in Section |VI C| and in Appendix 
B to discuss how this is modified when M is finite. 



The notation f{x) ~ g(x) means that there exists a constant C 7^ such that in the Umit x — >■ 00, 
f{x)/g{x) C. We use it to indicate that this relationship is only valid in the limit of large metaorder 
size N . 

The value of (3 is somewhat controversial, however: Eisler and Kertesz (2006) and Racz et al. (20091 
have argued that the correct value of /3 > 2. 



Long memory has been observed in the Paris Stock Market by Bouchaud et al. (20041, in the London and 



New York Stock Markets Lillo and Farmer ( |2004[ ), and in the Spanish Stock Market by Vaglica ( |2008[ )]. We 
use the term long-memory in its more general sense to mean any process whose autocorrelation function 
is non-integrable (Beran, [l994 ). This can include processes with structure breaks, such as that studied 



by Ding, Engle and Granger (19931. 

An interesting point that is relevant for the theory developed here is that the power law behavior of 



metaorder size comes from the heterogeneity of market participants. Vaglica et al. (20081 showed that 



metaorder size distribution for individual brokerages is roughly a lognormal distribution, and that a power 
law only emerges when all the brokerages are combined. 
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While we only care about the asymptotic form for large N, for convenience we assume 
an exact Pareto distribution for all A^, i.e. 

where the normalization constant C{P) is the Riemann zeta function. For the Pareto distri- 
bution the probability Vt that an order of size t will continue is 

where ({s, a) is the generalized Riemann zeta function (also called the Hurwitz zeta func- 
tion). The approximations are valid in the large t limit. 



1. Immediate impact 



The immediate impact can be easily calculated from Eq. (13). For Pareto distributed 



metaorder sizes, using Eqs. I\12n and (17), Rt is 



- + ^ai+Mai+At+i)) ^ ^^^^ 

Thus the immediate impact Xt = St — So behaves asymptotically for large t as 

\ log(t + l) for/3 = l ^^"^ 

The exponent /3 has a dramatic effect on the shape of the impact. For Lorenzian distributed 
metaorder size (/3 = 1) the impact is logarithmic, for /3 = 1.5 it increases as a square root, 
for /3 = 2 it is linear, and for /3 > 2 it is superlinear. Thus as we vary /3 the impact goes from 
concave to convex, with /3 = 2 as the borderline case^^. Figure [2] illustrates the reversion 
process for /3 = 1.5 and shows how the shape of the impact varies with /3. 



2. Permanent impact 



The permanent impact under the Pareto assumption is easily computed using Eq. (14). 
A direct calculation shows that^^ 



1 1 



r^~-y xf'-'dx = -N^-\ (21) 



The reason /3 = 2 is special is that for /3 < 2 the second moment of the Pareto distribution is undefined. 



Under the theory of Lillo et al. (20051, long-memory requires /3 < 2. 

This is the same impact vs. size derived by Gabaix et al. (20061. Their derivation is based on quite 



different reasoning, and requires mean- variance utility with a linear (rather than the usual quadratic) risk 
aversion term. They predict a different permanent impact. 
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FIG. 2: An illustration of predicted market impact for Pareto distributed metaorder size. Top 
panel: Immediate impact It (black circles) and permanent impact It (red squares) for f3 = 1.5. The 
dashed line is the price profile of a metaorder of size A'^ = 20, demonstrating how the price reverts 
from immediate to permanent impact when metaorder execution is completed. The inset shows a 
similar plot in double logarithmic scale for a wider range of sizes (from = 1 to = 1000). The 
blue dashed line is a comparison to the asymptotic square root scaling. Bottom panel: Expected 
immediate impact It as a function of time t for tail exponents f3 = 1,1.5,2 and 2.5, illustrating 
how the impact goes from concave to convex as (3 increases. 
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Eqs. (21) and (20) imply that the ratio of the permanent to the immediate impact is 

For example if /3 = 1.5 the model predicts that on average the permanent impact is equal to 
two thirds of the maximum immediate impact, i.e., following the completion of a metaorder 
the price should revert by one third from its peak value. 



C. Effect of maximum order size 

In the previous section we have assumed that ^ M, so that we can treat the problem 
as if M were infinite. In real markets the maximum order size is probably quite large, a 
significant fraction of the market capitalization of the asset. Thus we doubt that the finite 
support of has much practical importance, except perhaps for extremely large metaorders. 

From a conceptual point of view, however, having an upper bound on metaorder size 
creates some interesting effects. As we have already mentioned. Proposition 1 fails to hold 
when M = cxD, so this must be handled with some care. In Appendix C we illustrate how the 
results change when A^ M. What we observe is that when A^ < M/2 the impact is roughly 
unchanged from its behavior in the limit M — )• oo, but when A^ ^ M/2 the impact becomes 
highly convex. This is caused by the fact that the market maker knows the metaorder must 
end when A^ = M . Since by definition Vm = 0, the martingale condition requires that 
Sn+1 = Sn, i.e. there is no reversion when the metaorder is completed. This propagates 
backward and when A^ M it significantly alters the impact, as seen in Figure [3j 

Nonetheless, from a practical point of view we do not think this is an important issue, 
which is why we have relegated the details of the discussion to Appendix C. 



D. Market impact for stretched exponential metaorder size 

Changing the metaorder distribution has a dramatic effect on the impact. While we 
believe that the Pareto distribution is empirically the correct functional form for metaorder 
size, to get more insight into the role of we compute the impact for an alternative 
functional form. 

For this purpose we choose the stretched exponential, which can be tuned from thin 
tailed to heavy tailed behavior and contains the exponential distribution as a special case. 
There is no simple expression for the normalization factor needed for a discrete stretched 
exponential distribution, so we make a continuous approximation, in which the metaorder 
size distribution is 

The normalization factor r[a,z) is the incomplete Gamma function. The shape parameter 
A > specifies whether the distribution decays faster or slower than an exponential. (A > 1 
implies faster decay and A < 1 implies slower decay.) For short data sets, when A is small 
this functional form is easily confused with a power law. 
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It can be shown that the stretched exponential leads to an immediate impact function 
that for large t asymptotically behaves as 



(24) 



This is the product of a power law and an exponential; for large t the exponential dominates. 
The permanent impact is 



'^^nJ A^^A^ A^' 

where E„{z) is the exponential integral function and in the last approximation we have used 
its asymptotic expansion. The ratio of the permanent to the immediate impact of the last 
transaction is 

In contrast to the Pareto metaorder size distribution this is not constant. Instead the ratio 
between permanent and immediate impact decreases with size, going to zero in the limit as 
N ^ oo. The fixed ratio of permanent and immediate impact is a rather special property 
of the Pareto distribution. 



VII. DISCUSSION 
A. Comparison to the Kyle model 

The three types of agents in our model are similar to Kyle's; his informed trader is 

replaced by our long-term traders, and his noise traders are replaced by our day traders. 
In both cases we assume a final liquidation. There are also several key differences. In our 
model: 

• Our long-term traders do not have a monopoly, but rather have common information 
and compete in setting the size of their orders. Their orders are bundled together 
and executed as a package by an algorithmic execution service. These two facts are 
essential to show that the fair pricing condition is a Nash equilibrium. 

• The number of periods N for execution is uncertain, and depends on the information 
of the long-term traders. This is important because the martingale condition is based 
on the market makers' uncertainty about when the metaorder will terminate. 

• The distribution p{a) is arbitrary (whereas Kyle assumed a normal distribution). Our 
key result is that information is almost fully reflected in metaorder size, i.e. that 
Pn and p{a) are closely related. This means our results apply to any empirical size 
distribution p^. 

• Our day traders respond to ongoing information signals that are permanent in the 
sense that they affect the final price, in contrast to Kyle's noise traders, who are 
completely uninformed. This means that the information in the final liquidation price 
is incrementally revealed. 
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• The most important difference is that our model has a different purpose. Kyle assumed 
an information signal and solved for the optimal strategy to exploit it. We assume 
metaorder execution may or may not be going on in the background, and solve for its 
impact on prices. 



B. Empirical implications 

The theory presented here makes several predictions with clear empirical implications. In 
this section we summarize what these are and outline a few of the problems that are likely 
to be encountered in empirical testing. 



1. 



The fair pricing condition, Eq. [T0| is directly testable, although it requires a somewhat 
arbitrary choice about when enough time has elapsed since the metaorder has com- 
pleted for reversion to occur. (One wants to minimize this time because of the diffusive 
nature of prices, but one wants to allow enough time to make sure that reversion is 
complete). 

2. The asymmetric price response predicted by Eq. [7]is testable. However, this only tests 
the martingale condition, which is the less controversial part of our model. 

3. The equivalence of impact as a function of time and size is directly testable. Under our 
theory, for N > t the immediate impact from the first t steps is the same, regardless 
of A^. This is in contrast to the Kyle model which predicts linear impact as a function 
of time, but can explain concavity in size only by postulating variable informativeness 
of trades vs. metaorder size. 

4. The prediction of immediate and permanent impact based on p^r is directly testable 



through Equations 13 and 14 



If the metaorder distribution is a power law (Pareto distribution), then for large N the 
immediate impact scales as It ~ and the ratio of the permanent to the immediate 
impact of the last transaction is In /In = 1//3. See Section VI B 



Prediction (2) has been tested and confirmed by Lillo and Farmer (2004), Farmer et al 



prediction (5). 



(2006) and Gerig (2007). Preliminary results seem to support, or at least not contradict. 



The only studies of which we are aware that attempted to fit functional 

and Moro et al. 



form to the impact of metaorders are by Torre (1997), Almgren et al. 2005 



(2009); they find immediate impact roughly consistent with a square root functional form. 



Moro et al. also tested the ratio of permanent to immediate impact and found 0.51 for the 
Spanish stock market and 0.69 for the London stock market, with large error bars. To our 
knowledge the other predictions remain to be empirically tested. 



C. Information revelation 

Though the market makers in our model are uncertain whether or not a metaorder is 
present, if it is present, they know when its execution begins and ends. The ability to detect 
metaorders from imbalances in order flow using brokerage codes has been demonstrated 



[(Vaglica et al. 2008), (Toth et al 2010)]. A recent study of metaorders based on brokerage 
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code information found average participation rates of 17% for the Spanish stock market 
(BME) and 34% for the London Stock Market^^, for metaorders whose average size was just 
under 100 in both markets, making such metaorders difficuh to hide. The detection problem 
introduces uncertainties in starting and stopping times that may affect shape of the price 
impact. 

D. Final thoughts 

The traditional view in finance is that market impact is just a refiection of information. 
This point of view often goes a step further and postulates that the functional form of impact 
is determined by behavioral and institutional factors, such as how informed the agents are 
who trade with a given volume. This hypothesis is difficult to test because it is inherently 
complicated and information is difficult to measure independently of impact. Within the 
framework developed here, such anomalies would violate the fair pricing condition. 

In this paper we embrace the view that impact reflects information, but we show how at 
equilibrium the trading volume reflects the underlying information and makes it possible to 
compute the impact. The metaorder size distribution determines the shape of the impact 
but docs not set its scale. Metaorder size has the important advantage of being a measurable 
quantity, and thus predictions based on it are much more testable than those based directly 
on information. 

The fair pricing condition that we have derived here may well hold on its own, even 
without informational efficiency. This could be true for purely behavioral reasons: The fair 
pricing condition holds because it can be measured reliably, and both parties view it as fair. 
Thus while the main results here are consistent with rationality, they do not necessarily 
depend on it. 

We provide an example solution for the Pareto distribution for metaorder size because 
we bcficvc that the evidence supports this hypothesis. This gives the simple result that the 
impact is a power law of the form Xt ~ t^^^, and the ratio of permanent impact to the 
temporary impact of the last transaction is In /In — ^/ However, the bulk of our results 
do not depend on this assumption. Thus the reader who is skeptical about power laws may 
simply view the results for the Pareto distribution as a worked example. 

The strength of our approach is its empirical predictions. Because these involve explicit 
functional relationships between observable variables they are strongly falsiflable in the 
Poppcrian sense. A prcHminary empirical analysis seems to support the theory, but the 
statistical analysis so far remains inconclusive. We look forward to more rigorous empirical 
tests. 
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Appendix A: Proofs of the propositions 
Proposition 1. The martingale condition implies zero overall profits, i.e. 



M 



El [Nt^n] = PnNtin = 0. (Al) 

Af=l 

Proof. Given the definition of Rt and Rt we can write the prices as 

t-i 

St = So + Y,R^ (A2) 

t-i 

St+i = So + Y,R^-Rt (A3) 



i=0 

With these expressions for < M, tcj^ can be rewritten as 

1 ^ - 1 . 

ttat = — ^ St — Sn+1 = Rn — ^ iRi = 
t=i 1=1 

= -Rn Y^P^-nY.'^^ (A4) 

i=N+l i=l 

where in the last equahty we have used the martingale condition of Eq. ([T]). For N = M 
the profit per share is 

^ M 1 ^ 



i=l i=l 



^ M-1 

-M Y 



i=l 



By substituting these two last expressions in Eq. (Al) we obtain 

Af-l M M-l N~l M-1 

EiIn-kn] = J2 Yl p^-Yp^Y. ' p^' Y = 

N=l i=N+l N=l i=l i=l 

M-l M M N-1 

= Y Y pj-Yp^Y 

N=l j=N+l N=l i=l 

By explicitly computing the coefficients of each it is easy to show they vanish for each 
i, i.e. Ei[Nttn] = 0. 

Infinite support. This proposition does not hold when a has infinite support, in order 
to show this let us consider the expected profit for orders of length between = 1 and 
N = N. The following proposition holds: 
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Proposition 1'. The martingale condition for all intervals implies that for any integer 
N>1, 

N / ^ \ / ^ \ 

N=l \i=N+l / \i=l J 

This equation holds both for finite and infinite support (i.e. M can be finite or infinite). 



Proof. From the equation (A4) in tlie previous proposition, we know that martingale 



condition allows us to write the profit per lot traded as 

^ Af ^ N-1 

TTat = Rn Pi - ITT ^-Ri 

i=N+l i=l 

Therefore the expected profit for orders shorter or equal to iV < M is 

N N M N N-1 

^ PnNttn = ^ NRn ^ Pi - ^ Pat ^ i^i = 

N=l N=l i=N+l N=l i=l 

AI \ / N 



,i=N+l I \i=l 



This is equal to the quantity in Eq. (A7). Moreover it is clear that both terms in brack- 
ets are non-negative. This means that the market maker typically makes profits on short 
metaorders. 

If the support of pat is infinite then the martingale condition at all intervals implies that 

oo / °° \ / ^ \ 

E^IN-Kr,] = Y^PnNtin = Jim E E*^' • ^^9) 

N=l \i=N+l J \i=l J 

In the infinite support case the behavior of the limit in the last term of the above expression 
depends on the asymptotic behavior of and Rn for large N. This is due to the fact 
that for large N the first term in brackets goes to zero while the second term diverges. It 
is possible to construct examples where EiINun] goes to zero, to a finite value, or diverges. 
This result shows that in the infinite support case the martingale condition does not imply 
zero overall immediate profits. 

Proposition 2. // the second derivative of the immediate impact Xt is bounded strictly 
below zero, in the limit where the number of informed traders JC — )• oo, any Nash equilibrium 
must satisfy the fair pricing condition ttn = for 1 < N < M . On average market makers 
profit from orders of length one and take (equal and opposite) losses from orders of length 
M. 

The strategy of the proof is to show that counterexamples for which ir^ 7^ result in 
contradictions. First consider a candidate equilibrium with ttn < for some value of N, 
where 1 < < M. Assume a long-term trader k buys shares at an average price 
{J2tLi ^t)/N. After averaging over rjt the shares are subsequently valued at price S'tv+i = 
So + a. The profit is 



t=l J 



nfc(a) = Tifc I 5'o + a - — > ^ 5*4 I = -Uk-K^ > 
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If long-term trader k increases her order size by one share while all others hold their order 
size constant, her profit becomes 



1 \ 



The change in profit can be written 

An = U',{a) - U,{a) = -vr^ - ( ] { Sn+i - ^ } ] St ] . (AlO) 




The first term on the right (— ttat) represents the additional profit if it were possible to trade 
one extra share at the same average price, and the second term represents the reduction in 
profit because the average price increases. 

Since there is nothing to distinguish the long-term traders, the equilibrium must be 
symmetric, i.e. they all make the same decision. Thus if there are /C long-term traders 
buying shares and a day trader buying a random number of shares rjt, at equilibrium 
N = Uk- Thus if K is large, is also large. In the limit as K is large the second term 
vanishes if 

lim ( a J \ 1 I 5,v+i - ^ > I = 0. 



This is true providing the second derivative of the function St is bounded strictly below 
zero. Thus in this limit AH = — tttv > and the candidate equilibrium fails because the 
informed trader has an incentive to deviate. Similarly if vtjv > the informed traders take 
a loss which can be reduced by trading less. 

When tttv = (and as before 1 < < M) no informed trader has an incentive to change 



her order size. This is clear since in Eq. (AlO) with vr^v = the change in profit is given 



by the second term alone, which is always negative. A similar calculation shows that this is 
also true for decreasing order size, i.e. when tttv = 0, — )■ — 1 causes All < . 

The cases iV = 1 and N = M have to be examined separately because in these cases 
the fair pricing condition is incompatible with the martingale condition and informational 
efficiency (i.e. with the conditions on the final price). For = 1 the market makers' profit 
is 

TTi = 5*1 — 5*2 = -Rl, 



and from (Eq. |6]) the martingale condition is 

Vi{S2 - Si) + (1 - Vi){S2 - Si) = Vi{S2 - S2) + ^2 - 5i = 0. 

Thus if Vi 7^ 0, satisfaction of both the martingale condition and the fair pricing condition^^ 
requires that Si = S2 = S2, or equivalently that i?i = i?i = 0. In other words, if both 
conditions are satisfied then both the permanent and the temporary impact on the first step 
are identically zero, which would violate informational efficiency since a > 0. In Section W\ 
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In different terms, tfie incompatibility of fair pricing and tlie martingale condition was pointed out by 



Glosten (19941 
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we show by construction that this holds for all A^, i.e. it is clear in Eq. (13) and (14) that the 
impacts and /at are identically zero if -Ri = 0. To have sensible impact functions we must 
have i?i = TTi > 0, which means that market making is profitable on the first timestep^^. 

Similarly if = M the martingale condition implies Sm = Sm+i, i-e. no reversion, and 
since St is an increasing function the market maker takes a loss 



M M 



Mttm = 5^ 5i - MSm+1 = 5^(5, - Sm) < 0. 

t=l i=l 

Assuming ttjv = for 1 < A^ < M, the market makers' profit tti and loss Mttm are related 
by Eq. (g as 

tiiPi + MttmPm = 0. (All) 

For realistic size distributions we expect metaorders of size one to be much more common 
than those of size M, i.e. pi ^ pm- The ratio of the total profits is 



TTi Pm 

Thus the market maker receives frequent but small profits on metaorders of length one and 
rare but large losses for metaorders of length M. 

Long-term traders will rationally abstain from taking a loss on metaorders of length 
A^ = 1 by simply not participating when they receive a signals that are too weak; thus, the 
trading volume at A^ = 1 is due entirely to the day trader. Similarly, although tcm < 0, the 
long-term traders are unable to improve their profits by trading more, since we have bounded 
the total amount an individual can trade at (M — 1)//C so they are blocked from further 
increase. Thus the violations of the fair pricing condition when A^ = 1 and N = M occur 
naturally due to the institutional constraints that we have assumed and do not invalidate 
the equilibrium. 

Proposition 3. The system of martingale conditions (Eq. and fair pricing conditions 



(Eq. 10) has solution 



1 Pt l~Pi ~ 

l^i=t+l Pi l^i=t Pi 



Rt = -^^ ^^M^Ri t = 2,3,...,M-l (A12) 



^t = T^^* t = 1,2,...,M-1 (A13) 
i — Ft 



Proof. The solution of Eq. (A13) is a direct consequence of the martingale conditions 
(Eq. [6]). The total profit of metaorders of length N < M can be rewritten as (see proof of 
Proposition 1) 

i=l ^ i=l 



Ntt 



N 



NR 



N 



One might be tempted to naively conclude that market makers can defect from the equilibrium by simply 
trading orders only of length 1, so that they always make a profit. This is false: The profit from a 
metaorder of length one is tti = Si — {Sq + a), where a is a small number. In contrast, if a market maker 
participates only in the first trade of a large metaorder, her profit is t:[ = Si — {So + a'), where a' is a 
large number. Thus while tti > 0, tt^ < 0. 
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The fair pricing conditions (Eq. 10) state that for 1 < < M it is Nttn = 0, i.e. 

N-l 



R 



1 1-V 



N 



N 



N V 



N 



(A15) 



1=1 



This is a recursive equation which determines Rn once Ri is given (note that this equation 
does not hold for = 1 because we do not have fair pricing for metaorders of length one). 
The solution of this equation is 



Ri 



11 -Vt 



1 



-Ri 



t > 1 



(A16) 



t Vt V{P2V^....Vt-i 

and we prove it by induction. We assume that the solution holds for A^ = 2, 3, — 1 and 
we prove that it is true for N = t. If Eq. (A16) holds for A^ = 2, 3, .., t — 1 we can rewrite 
Eq. ([AT5| for A^ = t as 



Rt 



I I -Vt 
t Vt 



t-1 



i=l 



I I -Vt 
t Vt 



t-1 



1 + E 



1-V,. 



1 



i=2 



V, ViV2Vi....V,.i 



Ri 



(A17) 



Now by expanding the sum in brackets it is direct to show that 



1 + E 



1-V,. 



1 



1=2 



Vi ViV2V^....V,-i 



1 



1 



+ 



1 



Vi ViV2V^....Vt-i 



(A18) 



Since, by definition, Vi = 1 the first two terms in the right hand side cancel and thus one 
obtains Eq. (A16). This equation is equivalent to Eq. (A12). In fact 

1 



Rt 



11 -Vt 



-Ri 



Pt 



t Vt ViV2Vs....Vt-i tyf^^^p^m^PimAjPi ^fitP^ 



^1 



Pt 



EM T 



^Ei=t+iPi Ei.=tP 



^-P'R, (A19) 



l^i=t+l Pi l^i=t P^ 



i.e. our thesis, Eq. (A12). 



Appendix B: Effect of finite M on impact 



As already discussed briefly in Section VI C , if the condition A^ <^ M is violated this has 
an effect on the impact. In this section we consider the exact case of a finite support Pareto 
distribution. We show that when A^ ^ M we obtain the same results of the previous section 
and we discuss what happens when A^ ^ M. 

We assume that the metaorder size distribution is a truncated Pareto distribution for all 
N <M, i.e. 

»^r = ^ N > 1 (Bl) 
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where the normahzation constant H^j^^"^ is the harmonic number of order 1 + /3. For the 
truncated Pareto distribution the probabihty Vt that a metaorder of size t will continue is 

P ^ C(l + /3,t+l)-C(l + /3,M + l) 

C(l + /3,t)-C(l + /3,l + M) ^ ' 

where C{s,a) is the generalized Riemann zeta function (also called the Hurwitz zeta func- 
tion). For small t the function Vt increases meaning that it is more and more likely that 
the order continues. In the regime of t ^ M, Vt is well approximated by the expression of 
Eq. ( [Is) ) for an infinite support Pareto distribution. However, around t ~ M/2, Vt starts 
to decrease meaning that it becomes more and more likely that the order is going to stop 
soon, with a corresponding effect on the impact. 

The immediate impact can be easily calculated once the distribution of metaorder size 



is known by using Eq. (13). For truncated Pareto distributed metaorder sizes, Rt is (for 
t > 2) is 

p ^ ( ' - 1 \ A ,B3^ 

' I (C(l + /3.«)-C(l + /?,M + l))(C(l + fti+l)-C(l + ftA/ + l)i i^+f * ' 



For large t but t <^ M it is 



which is the same scaling as the infinite support Pareto distribution (see Eq. (20)). The 
same holds true for the permanent impact. We have therefore shown that when t -C M the 
finite support of the metaorder size distribution is irrelevant and we obtain approximately 
the same results as in Section IVIBI 

The finite size effects and the role of the finiteness of the support becomes relevant when 
t ^ M/2. Figure [3] shows the total impact for M = 1000 and different values of /3. It is clear 
that the impact is initially described by a power law, but then it becomes strongly convex 
when the order length becomes comparable with the maximal length. 
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FIG. 3: Immediate impact when the metaorder size has finite support, i.e when N has a maximum 
value M. Plot is in log log scale with M = 1000 and /? = 1.5, 2, and 2.5. 



